Rank in Wordlist | Frequency | Word |
---|---|---|
5319 | 156 | 1,000 |
5814 | 136 | 10,000 |
6152 | 125 | 5,000 |
7396 | 96 | 2,000 |
8545 | 77 | 3,000 |
8834 | 73 | 4,000 |
8907 | 72 | 20,000 |
8996 | 71 | 50,000 |
9627 | 64 | 4,000mAh |
10612 | 55 | 40,000 |
Rank in Wordlist | Frequency | Word |
---|---|---|
12686 | 41 | 100% |
13278 | 38 | 10% |
13282 | 38 | 5% |
14727 | 32 | 50% |
16228 | 27 | 25% |
16562 | 26 | 20% |
17688 | 23 | 90% |
18596 | 21 | 40% |
19129 | 20 | 60% |
19677 | 19 | 30% |
Rank in Wordlist | Frequency | Word |
---|---|---|
19699 | 19 | IL&FS |
26484 | 11 | J&K |
33421 | 7 | S&P |
62935 | 2 | M&M |
63153 | 2 | Q&A |
85026 | 1 | 282&290 |
89698 | 1 | BB&T |
90710 | 1 | I&B |
91041 | 1 | L&T |
91785 | 1 | R&D |
Rank in Wordlist | Frequency | Word |
---|---|---|
49686 | 4 | सा$फ |
126978 | 1 | फारू$ख |
134791 | 1 | मु$गल |
150151 | 1 | हफी$ग |
Rank in Wordlist | Frequency | Word |
---|---|---|
124 | 9017 | ." |
Rank in Wordlist | Frequency | Word |
---|---|---|
196 | 5902 | .' |
29216 | 10 | है'। |
29624 | 9 | ओ'ब्रायन |
36211 | 6 | India's |
45567 | 4 | कहा,'इस |
45568 | 4 | कहा,'मैं |
48906 | 4 | रामधारी सिंह 'दिनकर' |
51387 | 3 | O'Dwyer |
51517 | 3 | Teacher's |
62795 | 2 | I've |
Rank in Wordlist | Frequency | Word |
---|---|---|
26441 | 11 | 4GB+64GB |
31039 | 8 | 2+2 |
31072 | 8 | 3GB+32GB |
48112 | 4 | बीजेपी+शिवसेना |
50923 | 3 | 6GB+128GB |
51492 | 3 | Shift+Alt |
58432 | 3 | रैम+64जीबी |
60916 | 2 | 13MP+2MP+2MP |
60917 | 2 | 13MP+8MP+2MP |
61583 | 2 | 3GB+64GB |
Rank in Wordlist | Frequency | Word |
---|---|---|
10489 | 56 | 26/11 |
10737 | 54 | 9/11 |
14265 | 34 | एससी/एसटी |
15568 | 29 | 4GB/64GB |
17308 | 24 | https://t |
19130 | 20 | 6GB/128GB |
19711 | 19 | f/2.4 |
20907 | 17 | 6GB/64GB |
21602 | 16 | 8GB/128GB |
22370 | 15 | f/2.2 |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots